°l 



01 



E 



U 



CD 

o 
U 

CD 
CD 

o 



< 



O 



A 

o 
-a 

ft 4.1 

Sox 
\7 4? 
£ g V 

3 O ON 
£ V - 

. On m 
_ — ' 

a ~ 

> p 

o 

A 



2 

c3 



a 
o 

a 



c: 
o 



O 



V V V 



O 



CN 



5 > 
< w 



in 



O 



T3 
c3 



X X 

a a _ 
H h 



O 



~ o 

o « 

« to 

a o 

> o 



A 
c 



A 
to 

o 

v 

A 2 

EPJ3 

o 53 

V 3 

c § 

A o A 

&h a u c 

V V .h V 



o £0 

a o 
o 

a 

£ 
o 
o 
c 



§1 



A 



a 
M 

S 



a 
u 

A 
P 

CD 

V : 

V .= 




eg 
O 

UL 















a 


Coi 




Marked 


Original 





START 




Taxonomy tags are 
ordered by weight and 
grouped by taxonomy 



100 



Get a taxonomy 



110 



iLi 
01 



>fc? » 

G 




Emit the high confidence 
smart summary starting 
phrase 



130 



Emit the low 
confidence smart 
summary starting 
phrase 




Emit the high confidence 
smart summary phrase 

associated with the 
concept node and the tag 



i 

▼ 



Emit a period 
and a space 




■180 



FIG. 3 



• # 



o 




START 



Converting the document 
into a standard format 



I 

Add known taxonomy 

tags and meta-data tags 

to the content's list of tags 
! 

Markup the content 
structure 

I 

j_ 

Spot entities within the 
context 

1 

T 

Spot technical terms 
within the context 




© 

CM 



m 

c : 

Ui 



Us. 

IP" 

yi 



i 



En 



5 



I 



i 

.55 



I 



<5 



o 



3 

a 
I 



a 

•a 

•5? 



?2 
I 



Si. 

5 



A 

t 



5: 



o 
o 



a 



a 



CO 

?-i 
ft 

o 



CD 

g 

Li. 



00 




■S .2 

^ — , o _ _ 



• 




GO CU CL, CU 



yj 
Ul 



o 

• rH 
r— ( 

00 



~ 2 

= 1 

It 

5 - 



o 



ill 

l|! 

if 8 5 

■S SP 
Its ji. 
Is ! 



il. 



1 e i; j fill 



IP it 



6 i s 



= i S J 5 s 3 I 




i £ * -a sJi 1 




31-51 = i| S^l l| II 

f#iliJi ill ii ti 
H IlililHI ii 



II 

II 
*! 

SI 

11 

15 



H 
.11 



Z 5 2 



© 

c 

■8 

i 



■! 5 



m m * © 



8.8 



||| f 

ill I 

* : * s 

Ill £ 

* 5 - S 1! 

C-JII15 

'<-> 5 J* ^ e 

IBI 1 



2i]f lifU 

i.' 1 • si 
sg.fi :.i »f 1 
£§ Hi i = 

JliJlIiM 



c v g v 2 J £ 5 r> 1 



3 K< 



o 

CN 




on 
O 

o 
H 



O 



5 




CO 



o 
o 

oo 



1 



go cu 



CO 

H 



O 

CO 



CO 

H 
oo" 



.2 

O 
O 
CO 



CO 



o 

OO 



oo 
H 

^- 

E- 



o o 



o 



o 



CO CO 



00 



CO 



O 
O 
00 



CO 
O 

O 



o 

CN 
00 



CN 

H 



CN 
Oh 



O 

CO 



o 

CO 



o 

(N 
oo 



( 



in 

Oh 



o 



CN pO 



CJ 

o 
Q 



00 

g 

Li. 



£ .2 

CN — 

oo 00 



1 



On 

H 

CN 

H 





CN 


H 


H 


CN 


r— H 


H 


H 


o 








r , 
L ' 


H 



CN 
O 



O 

co a 

CO 



On 



# 



' START 



Collect the 
Generation Corpus 



Collect the Taxonomy 
Root Concept-Nodes 



Convert the 
Generation Corpus 
into XML Documents 



Root Concept-Node 
Collection and Input 



Identify and Input 
the Generation Corpus 



s 



902 



s 



904 



s 



906 



s 



908 



s 



910 




Perform Term Analysis 



~i 

T 



Identify Irrelevant Root W 
Concept-Nodes 



920 



.922 

wiy Taxonom^ 
'Assigned to a SmalP 
^Number of the Term/_ 
Features? 



Y 



N 



Remove Concept-Node W 
from the Input List 



924 




Remove at Least One High 

Cross-Correlation Root 
Concept-Node 



928 




FIG. 9a 



y s 




Replace at Least Two High 

Cross-Correlation Root 
Concept-Nodes with Abstract 
Root Concept-Nodes 



T 

— 





FIG. 9b 



Review Term 
Weighting by an SME 



1 



938 



Cluster Documents 
for Each Taxonomy 



940 



Generate a Report 
for Each Cluster 



942 



Create DAG 



1 



944 



Perform 1st Order 
Taxonomy Naming 



1 



946 



SMb Classifies 
Each Taxonomy 



948 



Generation Corpus 
Manually Tagged 



1 



950 



A Text Classifier 
is Trained on a 
Subset of the Data 



1 



952 



Each Node is Tested 



954 



Taxonomy 
Improvement 
is Initiated 



956 



Each Node is Tested 



958 



Concept Nodes 
Improvement is 
Initiated 




i axonomy 
Reviewed by 
an SME 




STAR] 



Input A, where A 
is a set of 
clusters, C 




Identify 
sufficiently similai 
clusters, C ; 



s 



9010 



Place Cj and C in 
partition, S 




£ 9000 




r 




Pick A cluster C 






in S 


9005 




r 




Find all c 


usters C. 






that are similar - 






to C 



9050 



9060 



Make an edge from 
C to every 
similar C. 



9070 






) — ■ 






Find an unconnected or 
multiple root node 






Add a node, RS and add 
an edge from RS to each - 
parentless node 







9080 



9090 




FIG. 9d 




Find all clusters C. that are not 
similar to any other clusters. For 
each C. find all graphs G K that 
are similar to the C. 



f 



9110 




9120 



Make an edge from 
C. to the root of each G K 



f 



9130 



Add a node to connect all 
disconnected clusters. Connect 
clusters to G„ 



9140 



Add an edge from the root 
concept node to each parentless 
node 



f 



9150 




9170 



CN 
CN 

o 

LL 




O 
LL. 



[1 ^ 




O 
J** 

Ul 



Ul 



o v*£#S 



<n 

QJ 



3 
u 

QJ 

tn 



m 
o 
> 

u 
o 

VI 

m 
£ 
u 
qj 

c 

-H 

CO 

c 

-H 



tn 



or) 

QJ 



3 
u 

QJ 

tn 

U VI 
JO OJ 

■H 

tn -h 

QJ QJ 

u 

CO X 

43 



o 



oi 
to 



o 

QJ 
S> 
C 
03 

U 

w 



QJ 
iJ 
CD 
U 

43 

• 3 

a o 
u u 
o & £ 



QJ 
U 

as c 



C 43 5 

«J U 3 c 

C 43 C 

> U D3 



G3 VI 
3 

QJ 

u 



u H in s 



QJ QJ 

tn ' 



o 

£ ~< 
o 3 



> QJ 
U & 
QJ Ctf 



42 

C 

O 
U 

u 



3 
I 

£ tn 

•H 3 

C T3 
•h c 
2 -h 



c 

•H QJ 

iJ & 

QJ U 
iJ QJ 
4* 
QJ O 

*d u 

oi o 
3 u 
T5 01 

C -H 
•H *d 





c 












o 












-H 












4J 












CC 












u 












o 












a 












u 












0 












o 












Cn 












c 






QJ 




QJ 


-H 






U 




0» 


TS 






u 


£ 


CD 


-H 








u 




0 




u 


0 


-H 


QJ 


23 




QJ 


01 








iJ 




QJ 


0 


QJ 


QJ 


o 


U 


crj 


u 


■a 


QJ 


u 




QJ 


43 






43 


-H 


-H 




u 






03 


4J 


OJ 




tn 


QJ 


c 


■H 


c 


-H 




C 




u 




u 


-H 


-H 


QJ 


3 


*H 


QJ 


«H 


—I 




U 






(0 


a 


X 


QJ 


0 




t3 


o 


QJ 


01 



01 QJ 

s > 

03 QJ 

a T3 

CJ > 

& 

> o 

-H O 

u c 

CO 43 

a u 

03 (U 

U 4J 



c 

QJ 
U 
C 

o 
u 

c: > 

O Tt 

-H 3 

aj tn 

c > 

QJ <U 

£ -h 

QJ O 

<— I (0 

a a 

£ as 

•H U 



COnnCOtOnnMMNMHHHHQ 



rH tH -H rH 











c 












O 




































03 












U 












o 












a 












u 












o 




0 








u 




*H 




















& 












c 
















u 








u 




o 












tn 








QJ 




in 




U 








-< 




U 












QJ 








> 




r-) 




01 








O 




QJ 












*H 


QJ 


01 




a 




iJ 


U 


3 








*H 


u 


T3' 




QJ 




iJ 


3 


C 




U 




3 


0 


M 




3 




U 


0) 








a 


QJ 


QJ 


01 


C 


u 


3 


tn 


U 


0J 


3 


3 


O 






*H 


*H 




u 




& 


U 






o 


cd 




-H 


-H 


tn 




c 


-H 






as 


u 


0 


VI 


3 


3 


u 


QJ 


-H 


VI 


U 


JJ 


VI 






03 


QJ 


3 


c 


0 


03 


4J 


in 


£ 


•H 


H 




tn 


o\ 


CD 


CD 


(S3 


tH 


o 


a 


n 










rH 


tH 


rH 


tH 


rH 


rH 



mi 



o 
a 

q 

H 




